Telephone Speech Corpus Development at Cslu

نویسندگان

  • Ronald Cole
  • Mark Fanty
  • Mike Noel
  • Terri Lander
چکیده

This paper describes eight telephone-speech corpora at various stages of development at the Center for Spoken Language Understanding. For each corpus we describe data collection procedures, methods of soliciting callers, protocol used to collect the data, transcriptions that accompany the speech data, and the expected release date. The corpora are (or will be) available at no charge to academic institutions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Connected Digit Recognition Experiments with the OGI Toolkit's Neural Network and HMM-Based Recognizers

This paper describes a series of experiments that compare different approaches to training a speakerindependent continuous-speech digit recognizer using the CSLU Toolkit. Comparisons are made between the Hidden Markov Model (HMM) and Neural Network (NN) approaches. In addition, a description of the CSLU Toolkit research environment is given. The CSLU Toolkit is a research and development softwa...

متن کامل

The CSLU speaker recognition corpus

This paper describes the CSLU Speaker Recognition Corpus data collection. The corpus was motivated by a need for speech data from many speakers, under different environmental conditions, with each speaker providing data over a significant period of time. The corpus was designed to provide sufficient data to study phonetic variability within and across sessions, and to design and evaluate system...

متن کامل

A brazilian portuguese language corpus development

This article presents the techniques that are being used for the creation of a database related to the Brazilian Portuguese language. This database is composed of a collection of recorded voices, from different speakers and different regions of Brazil. The collected voices contain varied phonetic and phonologic information. The applications of this database are diverse, including synthesis and ...

متن کامل

Tools for Research and Education in Speech Science

The Center for Spoken Language Understanding (CSLU) provides free language resources to researchers and educators in all areas of speech and hearing science. These resources are of great potential value to speech scientists for analyzing speech, for diagnosing and treating speech and language problems, for researching and evaluating language technologies, and for training students in the theory...

متن کامل

Quantitative Analysis of Pitch in Speech of Children with Neurodevelopmental Disorders

We analyzed the prosody of children with Autism Spectrum Disorder, Developmental Language Disorder, and typical development in conversational speech, using the CSLU ADOS speech corpus. We found several significant differences in the pitch characteristics of these diagnostic groups, and report automatic classification utilizing these features that are well above chance level. We show that the ch...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998